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Abstract 

It has been suggested that mitochondrial dysfunction plays a role in metabolic disorders including obesity, diabetes, and 
hypertension. The fact that mitochondrial defects can be accumulated over time as a normal part of aging may explain why 
some individuals can eat all sorts of foods and remain at normal weight while they are young. However, around the fourth 
decade of life there is a trend towards "middle-age spread" with weight gain and the body's decreasing ability to 
metabolize calories efficiently. To test the hypothesis that mitochondrial variants are associated with BMI in adults, we 
analyzed a total number of 984 mitochondrial single nucleotide polymorphisms (mtSNPs) in a sample of 6,528 individuals 
participating in the KORA studies. To assess mtSNP association while taking heteroplasmy into account we used the raw 
signal intensity values measured on the microarray and applied linear regression. Significant results were obtained for 2 
mtSNPs located in the Cytochrome c oxidase subunit genes (MT-COl: Padjusted = 0.0140 and MT-C03: Pgdjusted = 0.0286) and 3 



mtSNPs located in the NADH dehydrogenase subunit genes {MT-NDl, MT-ND2 and MT-ND4L: P 



adjusted ' 



: 0.0286). 



Polymorphisms located in the MT-C03 and MT-ND4L genes have not been associated with BMI or related phenotypes in 
the past. Our results highlight the importance of the mitochondrial genome among the factors that contribute to the risk of 
high BMI. Focusing on mitochondrial variants may lead to further insights regarding effects of existing medications, or even 
to the development of innovative treatments. 
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Introduction 

Excess of weight is a major risk factor for mortality and 
morbidity from cardiovascular disease [1], type 2 diabetes [2], and 
incident cancer [3] . Tliree million deaths per year worldwide are 
attributed to overweight and obesity in total [4]. Obesity is a 
medical condition in which excess body fat has accumulated to the 
extent that it may have an adverse effect on health [5]. Anyway, 
the conflation of lean and fat body mass becomes inconsistent in 
the elderly with both muscle mass and strength begining to decline 
as part of aging [6] . Changes such as accumulation of intra- and 
extra-myocellular lipids, improper folding of structural and 
contractile proteins, and mitochondrial dysfunction are thought 
to occur with age [7-9] . Dysfunctional mitochondria are thought 
to play a key role in function decline with aging, as the 
mitochondria are the main producers of both cellular energy 
and free radicals. 



Mitochondria are membrane-bound organelles found in the 
cytoplasm of almost aU eukaryotic cells. Their primary function is 
to generate large quantities of energy in the form of adenosine 
triphosphate (ATP). In the early 1960s it was discovered that 
mitochondria contain their own DNA (mtDNA) of approximately 
16.6 kb. It is known to code for 13 subunits of the mitochondrial 
respiratory chain complexes, 2 ribosomal (rRNA) genes, and 22 
transfer RNA (tRNA) genes that are required for mitochondrial 
protein synthesis. Mitochondria consume oxygen and substrates to 
generate the vast majority of ATP while producing reactive 
oxygen species (ROS), also known as free radicals, in the process. 
An excess of ROS may damage DNA, proteins, and lipids if not 
rapidly quenched. This damage is termed oxidative stress. It has 
been recently recognized that oxidative stress is increased (ROS 
overproduction) in obesity [10,11] and in fat accumulation in 
adipocytes [12]. The 13 structural genes are essential for energy 
production through the process of oxidative phosphorylation 
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(OXPHOS) and generation of ATP. The activity of the OXPHOS 
system that consists of five enzyme complexes (I-V) can be affected 
by a number of factors including age and physical training [13]. In 
addition to supplying ATP and involvement in oxidative stress, 
mitochondria also participate in other cellular processes, including 
signal transduction, cell cycle regulation, thermogenesis, and 
apoptosis. mtDNA is maternally inherited and lacks recombina- 
tion [14]. One major difference between mtDNA and nuclear 
DNA (nDNA), which is relevant for the understanding of human 
diseases, is the heteroplasmy effect, which was originally believed to 
be a rare phenomenon. Because a cell carries many mitochondria, 
and also the mitochondrial genome has a higher mutation rate 
than the nuclear genome, there is heterogeneity of the relative 
frequency of a mtDNA variant within an individual tissue, cell, 
and even within the same mitochondrion. Therefore, mitochon- 
dria are considered heteroplasmic. The chnical expression of some 
phenotypes is determined by the relative proportion of normal and 
mutant mitochondrial genomes in different tissues. Mutations of 
mtDNA are under a growing scientific spotlight and there is 
increasing evidence that these mutations play a central role in 
many, if not most, human diseases. 

In the past years, genome-wide association studies (GWASs) 
have identified over 90 common genetic variants associated with 
BMI in adult samples [15-19]. A growing body of research is 
demonstrating that altered mitochondrial energy production is a 
major anomaly capable of setting off a chain of metabolic events 
leading to obesity [20-23]. It has also been suggested that 
particular mtDNA haplogroups might be associated with ineffi- 
cient energy expenditure [24] and obesity [25]. A mtDNA 
haplogroup is a collection of mtSNPs at certain genetic loci 
accumulated throughout human history that could be attributed to 
genetic drift and/or climate selection [26]. The fact that 
mitochondrial defects can be accumulated over time as a normal 
part of aging may explain why some individuals can eat all sorts of 
foods and remain at normal weight while they are young. 
However, around the fourth decade of life there is a trend towards 
"middle-age spread" with weight gain and the body's decreasing 
ability to metabolize calories efficiently. In two newly published 
studies in animals, researchers show how the dynamics of the 
mitochondria play a key role in the body's ability to control weight 
gain [27,28]. Hence, whether one becomes obese or remains lean 
can depend on the dynamics of the mitochondria, the body's 
energy-producing "battery". 

The purpose of the current study was to conduct a mitochon- 
drial GWAS to identify genetic variants influencing BMI. In 
particular, we tested 984 mtSNPs in a sample of 6,528 adults, aged 
24-85 years. 

Results 

We performed mitochondrial genome wide association analysis 
separately for each genotyping chip and corresponding sample (see 
Table 1). After QC, a total number of 984 mtSNPs were included 
in the analysis. To assess mtSNP association while taking 
heteroplasmy into account, we used the raw signal intensity values 
that were measured on the microarray and applied the linear 
regression method as described in the Materials and Methods 
section. The resulting p-values after adjustment for multiple testing 
are plotted in Figure 1 for each genotyping chip. The x-axis 
represents the mitochondrial genome, showing the position and 
relative size of each of the 13 major mitochondrial genes. 

Four and one mtSNPs were significant for ASymetrix 6.0 and 
lUumina Exome chip, respectively. No significant mtSNPs were 
observed when analysing the Axiom and Metabochip. The association 
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Figure 1. Mitochondrial genome-wide /'values after adjusting for multiple testing. On the y axis, adjusted p-values transformed into the 
negative of the base 1 0 logarithm, — log,o(p-value), are shown. The x-axis represents the mitochondrial genome, displaying the position and relative 
size of each of the 13 major mitochondrial genes, 12S and 16S rRNAs, hypervariable region 1 (HVRI), hypervariable region 2 (HVR II) as well as the 
position of the 22 tRNAs (gray). The dashed lines show the critical values of the pointwise significance level corresponding to an FDR of 0.05. 
doi:10.1371/journal.pone.0105116.g001 



results that remained significant after adjustment for multiple testing 
(Padjusted— 0.05) are presented in Table 2. The two most significant 
mtSNPs associated with BMI are located in the Cytochrome c oxidase 1 
and 3 (MT-COi^^tesesA-^G and Mr-C05,„,9698T-^c) genes of 
complex rV, with associated adjusted p-values of 0.014 and 0.029, 
respectively. Another three locations NADH submit dehydrogenase 
genes of complex I were also significant (MT-ND1^^,,^qy^o'- 
Padjusted = 0.029; Mr-ArZ)2„„485ic^T: Padjus.-d = 0.029; MT- 
iVZ)4L„tio.^soA-^G: Padjusted = 0.029). In all significant mtSNPs the 
estimates of and P^jg,, were signiGcant and positive, indicating a 
significantly higher BMI in males and with older age. AH five 
significant mtSNPs were present only on one chip without 
overlapping, i.e., they were not shared by the other chips. 

A negative parameter estimate for the mtSNP (Psnp<0) 
indicates that the risk allele is the minor allele while a Psnp>0 
indicates that the risk allele is the major allele. Taking the most 
strongly associated variant mt6663A-^G in the MT-COl gene. 



based on our estimate (Pmt6663A-^c; = — 0.55±0.1) BMI increases 
when the ratio of alleles A and G decreases, i.e., an increase in 
heteroplamy with the G allele at this locus leads to an increased 
BMI. The same applies to the other mtSNPs; e.g. having 
mtl0550A-^G heteroplasmy with more G than A alleles results 
in a higher BMI than having only A alleles. The estimates of the 
model parameters for each significant mtSNP are provided in 
Table SI. 

Discussion 

Excess of weight is a medical condition in which excess body fat 
has accumulated to the extent that it may have an adverse effect 
on health, leading to reduced life expectancy and/or increased 
health problems. The possible role of mitochondria in the 
development of obesity is mainly concerned with the efficacy of 
mitochondrial energy provision [29,30]. Apparently, the balance 



PLOS ONE I www.plosone.org 



3 



August 2014 | Volume 9 | Issue 8 | e105116 



mtSNPs Associated with BIVll in Adults 



between energy conservation into the ATP molecule and energy 

'M dissipation in the form of heat loss thus represents the key 

g metabolic control, which is crucial for possible development of 

I obesity. Obesity, as well as fat accumulation in adipocytes, has 

£ been related to an increased ROS production in human 

mitochondria [10-12]. In the current study, in which we 

^ examined association between BMI and mtSNPs, we identified 

S two mtSNPs (mt6663A-^G, mt9698T^G) located in Cytochrome 

5 c oxidase subunits (MT-COl and MT-C03) and three mtSNPs 

.£ (mt3336T^G, mt4851C^T, mtl0550A-^G) located in NADH 

I subunits {MT-NDl, MT-ND2, and MT-ND4L), that were 

^ significantly associated with BMI after adjusting for multiple 

™ testing. Although mtDNA is transmitted from mother to offspring 

unchanged (lack of recombination), due to the higher mutation 

rate in mtDNA compared to nDNA, many mutations have 

S occurred only recently rather than many generations ago, and 

J even in the current generation. For that reason pairwise LD is 

Q reduced, and when one mtSNP is significant it is not expected that 

^ the neighboring mtSNPs are also significant. 

to 

1 MT-COl ^,e663A-^G and MT-C03^^ge9aj^c 

Mitochondrially encoded cytochrome c oxidase sub- 

y units, complex IV, is a key oxidative enzyme regarded as one of 

2 the major regulation sites for the OXPHOS system, controlled by 
i both iiDNA and mtDNA. Its catalytic activity is primarily 
!S determined by 3 of the 13 subunits which are encoded by the 
E ^ mtDNA {MT-COl, MT-C02, and MT-C03) [31]. The loss of 
£ 2 function of this enzyme has been suggested to trigger ROS 
c 2 production, although the increase in radical accumulation rests not 
I i with the ETC but with non-mitochondrial sources [32]. However, 
^ the function of each subunit and the molecular mechanism behind 
g "S the regulation on the activity of this iinportant protein complex are 
§ largely unknown [33]. The interesting finding that suppression of 
z a cytochrome c oxidase activity also sensitizes cells to death signals 
=g [33] could shed some light on the role of defective cytochrome c 
g ^ oxidase in aging and age-related diseases as an increase of BMI 
^ "= with age. 

i I MT-NDl ^asseT^c, MT-NDI^^^ssic-^i, and MT- 

OSSOA^G 



v. i 



E £ 



- (13 



E I Mitochondrially encoded NADH dehydrogenase sub- 

g ^ unit genes, complex I, is the first enzyme in the mitochondrial 

S 1 respiratory chain. It extracts energy from NADH, produced by the 

E ffi oxidation of sugars and fats, and traps the energy in a potential 

diflFerence or voltage across the mitochondrial inner membrane. 
The potential difference is used to power the synthesis of ATP, 
Because NADH dehydrogenase is central to energy production in 
the cell, its malfunction may result in a wide range of metabolic 
disorders. Some of them are due to mutations in the mitochondrial 
genome, while others, which result from a decrease in the activity 
of complex I, or an increase in the production of ROS, are not yet 
well understood. 

^ Ji g MT-NDl and MT-ND2 genes were previously reported to be 

related with body fat mass and intramuscular fat in animals 
[23,34,35]. The significant findings in our study at mt3336T^G 
{MT-NDl) and mt4851C^T {MT-ND2) have associated effect 
sizes estimated to be —0.23 and —1.11, respectively. In other 
words, increasing the number of T alleles (or decreasing G alleles) 
at mt3336 in an individual will help to reduce the BMI, while 
!t ^ increasing the number of T alleles (or decreasing C alleles) at 

-g ^ mt485 1 will contribute to increase BMI. In this study we identified 

for the first time a significant association between MT-ND4L 
(mtl0550A^G) and BMI, with an estimated effect size of -0.31. 
Therefore, the G allele is the one to be related with high BMI 
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values, i.e., people with heteroplasmy having a large number of G 
alleles will have a higher BMI than people with a smaller number 
of G alleles. MT-ND4L is one of the 7 mtDNA-encoded subunits 
included among the approximately 41 polypeptides of respiratory 
complex I. MT-ND4L is probably a component of the hydropho- 
bic protein fragment of the complex [36]. 

While associations of the MT-COl gene with intramuscular fat 
in animals have been reported [35], no significant findings have 
been reported so far for the MT-C03 gene with BMI, nor with 
other BMI-related phenot)'pes. Liu et al. [37] suggested an effect 
of mtSNPs, including MT-C02, MT-NDl, and MT-ND2 genes, 
on BMI but this effect failed to reach significance after adjusting 
for multiple testing. Similarly, Saxena et al. [38] detected nominal 
signals of association between BMI and mtSNPs, including MT- 
NDl, MT-ND2, and MT-ND3 genes; however after adjusting for 
multiple testing these associations disappeared. 

Recently, Knoll et al, 2014 [39] did not find association 
between mtSNPs and obesity in children. However, this may be 
because heteroplasmy was not included in their study or difierent 
mtSNPs might be relevant for children than adults with regard to 
BMI. 

Two of the significant variants MT-NDl ^^y^jfj-Y^G <md MT- 
COl,.., 'U'" missense mutations which lead to an amino 
acid change, thus being a non-synonymous variant. So, individuals 
with an excess of missense mutations may carry an appreciable 
fraction of an altered protein that is responsible for an elevated 
BMI. The other three significant variants MT-ND2^i^<^-^iq,_^-y, 
Mr-C03n„969sT^c, and MT-ND4Lj„t\Q''',:MA-^G ^ire synonymous 
mutations, i.e., they code for the same amino acid. How an excess 
of synonymous mutations at these loci could impact BMI needs 
further investigation, since the single nucleotide change leads to an 
unchanged protein. However, different codons might lead to 
dilferential protein expression levels. Two of the synonymous 
variants, mt9698C and mtl0550G, are present in the European 
mtDNA haplogroups U8 and K, respectively. However, to date 
only haplogroup T has been suggested as a risk factor for obesity 
[24]. 

These results together with previously pubhshed findings suggest 
that genetic variation in NADH dehydrogenase and cytochrome c 
is a risk factor for several metabolic disorders, including BMI and 
BMI-related phenotypes. Multiple lines of evidence suggest that 
mitochondrial dysfunction as well as altered mitochondrial 
genome expression, particularly of genes encoding complex I-IV 
may play a role in BMI. Searching for previous suggested 
candidate mtSNPs associated with BMI or related phenot)'pes has 
yielded a small number of positive findings, possibly due to low 
sampk- siz(;s or th<; lack of taking heteroplasmy into account in the 
analysis. In this study we have considered the heteroplasmy effect 
and found mtSNPs in candidate gene regions to be significantly 
associated with BMI that previously were suggested to be 
implicated in obesity, body fat mass, or intramus(;:ular fat (MT- 
NDl, MT-ND2, and MT-COl). In addition, two novel suscep- 
tibility mtSNPs in the MT-ND4L and in the MT-C03 regions 
have been identified. In both polymorphisms heteroplasmic 
individuals towards a larger number of the respective minor allele 
increases the risk of developing a high BMI. 

The regression models for all significant mtSNPs identified in 
this study also corroborate the generally acknowledged fact that 
males have slighdy but significant higher BMI values than females 
and BMI also increases significantiy with older age. With repond 
to sex, this effect is to be expected due to the fact that adult males 
and females with the same BMI are classified identically in regard 
to whether their weight is normal. However, even though men 
may have slightly higher BMI values than women, this does not 



necessarily mean that they are more overweight. For instance, 
women typically have more body fat than men, whereas men 
typically have more muscle than women, and muscle has a much 
greater density (it takes up less volume than an equal mass of fat). 
It is also known that BMI is largely influenced by age in adults 
[40,41]. It has been suggested that BMI increases linearly with age 
in women, but in men it increases with age in two stages - a more 
intensive rise between age 20-40 and a much slower increase 
between age 40-60 [42] . High BMI can occur at any age, but with 
both muscle mass and strength beginning to decline around the 
age of 40, an increased BMI at this age may be related to 
mitochondrial dysfunction. 

Conclusion 

In summary, our study reports genetic variants located in the 
cytochrome c subunits 1, 3, and in the NADH dehydrogenase 
subunits 1, 2, and 4L of the mtDNA that are significantiy 
associated with BMI in adults. Heteroplasmy in these variants 
towards the minor allele increases the risk of developing high BMI. 
Based on these findings we hypothesize that common metabolic 
diseases as obesity or high BMI are attributable at least in part to 
mitochondrial polymorphisms. Animal and human data consis- 
tently show that mitochondria are altered in aging, leading to 
increased mutations in mtDNA, decreased expression of some 
mitochondrial proteins, reduced enzyme activity, and altered 
respiration with reduced maximal capacity in sedentary adults. 
Since the primary role of mitochondria is to produce ATP to 
maintain the energy status of the cell, shifts in respiratory activity 
and capacity can lower the membrane potential, reduce cellular 
ATP concentration, and signal cellular apoptotic events. Increased 
apoptosis without correspondingly increased protein synthesis will 
eventually lead to net muscle fiber loss. All of these factors are 
likely to contribute to age-associated BMI, and mounting evidence 
suggests that most of these age-related changes can be either 
prevented or attenuated through increased physical activity. There 
are some thoughts that the accumulation of oxidative damage 
caused by long-term ROS production is responsible for these 
changes with age. Cytochrome c and NADH dehydrogenase 
complexes are both involved in the regulation of mitochondrial 
ROS. Indeed, blocking ROS formation in animals by the targeted 
expression of antioxidant enzymes ameliorates age-associated 
dysfunction and returns mitochondrial parameters to those of 
young animals [43] . Whether this is also true for humans remains 
to be shown. However, behavioral factors that may improve 
mitochondrial function and BMI control, such as physical activity 
and caloric restriction, also reduce ROS production and increase 
antioxidant defenses. Although physical activity training in elderly 
adults probably does not completely reverse the primary effects of 
aging, it may clearly reduce the rate of mitochondrial decline and 
attenuate the aging processes [44] . 

These findings highlight the important role of the mtDNA 
among the factors that contribute to the risk of high BMI in adults 
and suggest that variants in the mitochondrial genome may be 
more important than has previously been suspected. Therefore, a 
challenge for possible novel therapeutic strategies might be the 
further elucidation of intracellular ROS signaling pathways for 
effective therapeutic gain. 

Materials and MetKiods 

Study design and population 

The Cooperative Health Research in the Region of Augsburg 
(KORA) study is a series of independent population-based 
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epidemiological surveys and follow-up studies of participants living 
in the region of Augsburg, in southern Germany, in a mixed urban 
and rural area with demographic and socioeconomic character- 
istics roughly reflecting those of an average central European 
population. All participants are residents of German nationality 
identified through the registration office and written informed 
consent was obtained from each participant [45]. The study was 
approved by the local ethics committee (Bayerische Landesarzte- 
kammer). All participants filled in a self-administrated question- 
naire and underwent a standardized personal interview and an 
extensive medical examination. The study design, sampling 
method, and data collection have been described in detail 
elsewhere [46]. The most recent KORA studies are KORA S4 
(1999-2001) with its follow-up KORA F4 (2006-2008) and 
KORA S3 (1994-1995) with its follow-up KORA F3 (2004- 
2005). The present study includes data of the studies F3, F4, and 
additionally from those individuals of S4 that did not participate in 
F4, including a total number of 6,528 unrelated individuals. In 
order to avoid confounding with insulin-dependent diabetes 
meUitus, 247 individuals diagnosed as diabetic were not included 
in the study. 

BMI phenotype 

The body mass index (BMI), or Qtietelet index (devised between 
1830 and 1850 by the Belgian polymath Adolphe Quetelet), is a 
measure for human body shape defined as the individuafs body 
weight divided by the square of the height (kg/ m"). AH participants 
were subjected to several medical examinations including BMI 
measurement. 

Genotyping 

DNA was extracted from fuU blood after the blood draw and 
then stored at — 80°C. Genotyping was performed using different 
platforms such as the Affymetrix 6.0 GeneChip array (KORA F4), 
lUumina MetaboChip 200K (KORA F3 and F4), lUumina Human 
Exome Beadchip array (KORA F4), and Affymetrix Axiom chip 
array (KORA S4 and F4). A total number of 6,528 individuals 
were genotyped. Some individuals of KORA F4 were present on 
more than one chip, some of them (1,549) were included on all 
chips. Only single-nucleotide polymorphisms located in the 
mitochondrial genome (mtSNPs) were considered in this study. 
Genotyping data and samples are summarized in Table 3. Most of 
the covered mtSNPs have distinct positions identified by different 
chips. Although the Affymetrix 6.0 is the one containing the largest 
number of mtSNPs some regions are not well covered. The 
lUumina Metabochip contains the smallest number of mtSNPs and 
many regions are uncovered, especially the hypervariable regions 
(HVRI and HVRII) as well as the CO 1 and C02 genes. However, 
when all chips are considered together, good overall coverage of 
the mitochondrial genome is obtained (Figure 2). 

Table 3. Distribution of characteristics of the study population. 



Genotype calling 

Current SNP microarrays are able to genotype more than a 
million SNPs simultaneously. There are some statistical issues such 
as the normalization of raw intensities, background correction, 
outlier detection, and allele calling (genotyping) that are shared 
between different microarray platforms irrespective of chip 
technology and design principles. The most commonly used 
platforms, Affymetrix and lUumina, represent a SNP by a number 
of probes (Affymetrix) or beads (lUumina) for each of the two 
aUeles. The probes/beads have different affinities, depending on 
the DNA sequence they target. A first step in many algorithms for 
genotyping is to cluster the probe intensities for each allele and 
SNP, and in a second step to make a caU based on the clustered 
intensities. 

The allele frequency, one of the most important population 
indices used in genetics, denotes the relative frequency of a certain 
aUele at a marker locus. There are two kinds of allele frequencies: 
individual-level that represents the within-individual relative 
frequency of aUeles and its standard error reffects inter-ceU 
variabUity in an individual, and population-level that refers to the 
within-population relative frequency of aUeles and its standard 
error reUects inter-individual variability in a population [47]. 
Using SNPs, both individual and popvUation aUele frequencies can 
be estimated using an aUele counting approach (genotype-based) 
and using an intensity measuring approach (intensity based). A 
problem arises, however, when dealing with mtSNPs due to the 
heteroplasmy effect, causing heterogeneous mtDNA in the sense 
that different mitochondria can have different genotypes, such that 
a genotype at an mtDNA locus may not be restricted to 0, 1, or 2 
minor alleles. This issue affects the possibility of estimating 
genotypes and makes calling algorithms useless. Therefore, 
whenever one intends to identify susceptibility genes located in 
the mtDNA we recommend accounting for heteroplasmy using 
individual-level allele frequencies obtained from the intensity 
values. 

Quality control 

First a visual check was performed plotting the intensity 
measures of each mtSNP separately for each chip. In this step 
only the Axiom chip displayed anomalies in some intensity values. 
There seems to be an upper bound for the intensity values at an 
intensity of 4095. As a result some mtSNPs had a significant 
fraction of either their A or B intensities squeezed into the region 
just below that upper bound; apparently without the bound those 
intensities would have taken higher values. Since intensities close 
to the cut-off value of 4095 cannot be trusted, we proceeded as 
described in the Text SI. 

After the visual check, normalization of microarray data and 
outiier corrections were performed separately for each chip as 



Chip 


mtSNP 


KORA 


Individuals (IVIales/Females) 


Mean age Males/Females 


Mean BMI Males/Females 


Affy. 6.0 


465 


F4 


1646 (788/858) 


60.4±8.8/60.1±8.7 


28.2±4.3/27.5±4.9 


Affy. Axiom 


252 


S4/F4 


3530 (1702/1828) 


54.1±14.2/53.6±13.9 


27.7±4.2/26.9±5.1 


Ilium. Exome 


226 


F4 


2721 (1294/1427) 


55.5±13.2/54.8±13.1 


27.7±4.0/26.9±5.1 


Ilium. Metabochip 


135 


F3 


2889 (1393/1496) 


56.6± 12.9/56.1 ±12.5 


27.8±3.7/27.0±5.0 






F4 


2817 (1338/1479) 


55.5±13.2/54.9±13.1 


27.7±4.1/27.0±5.1 



The mtSNPs column shows the number of mitochondrial SNPs provided by each chip. Distributions are presented as means ± standard deviation. 
doi:10.1371/journal.pone.0105116.t003 
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Figure 2. Coverage of the mitochondrial genome provided by each genotyping chip. Each vertical bar stands for one mtSNP. The x-axis 
represents the mitochondrial genome, displaying the position and relative size of each of the 13 major mitochondrial genes, 12S and 16S rRNAs, 
hypervariable region 1 (HVRI), hypervariable region 2 (HVR II) as well as the position of the 22 tRNAs. 
doi:l 0.1 371 /journal.pone.01 051 1 6.g002 



described in Text S 1 . A summary of the quality control results is 
given in Table 1. 

Statistical methods 

To approach the heteroplasmy present in the mitochondria we 
used the raw signals of luminous intensity, where every measure- 
ment is associated with a specific mtSNP and represents one of its 
alleles [48,49] . The number of measures n per mtSNP depends on 
the vendor-specific technology employed on the genotyping chip. 
At the very least there have to be two signals, one for each of the 
two alleles. Often, however, the chip design includes more than 
one measurement per SNP and allele. That is, for every individual 
and SNP we have intensity measurements (Aj, Bi), (A^, B„) 
with n> = 1 where A; and B; represent the intensities of the two 
alleles A and B. The best way to assess the association of BMI with 
the mtSNP intensities at a given age is to apply linear regression 
analysis using BMI as quantitative response variable. The mtSNP 
enters the model as a covariate via the log2-transformed intensity 
ratio, log2(A/B), where A and B denote the mean intensity, or 
single m('asure in case of n = 1 , for the .4 allck' and B allele, 
respectively. To improve the convergence properties of the model 
estimates we center this variable (z = log2(A/B) — |i) as well as the 
additional quantitative covariate age at examination, i.e., the age 
at the time that the mtSNPs and BMI were measured. Sex is also 
introduced in the model as a covariate. The linear regression 
model for the ith-mtSNP is: 

BMI; = Po + PiZy + P2^gSj + P^seXj + ey f ory = 1 . . . m 

where J denotes the individual and m is the sample size. The aim of 
this study is to evaluate how BMI is influenced by mitochondrial 
variants taking into account the age for a particular person on the 
phenotype BMI, and therefore the model adjusts for individual age 
differences in the study population. Using this model it is not 
possible to make any claims based on the effect of age on 
heteroplasmy at the particular mtSNP. However, due to the 
adjustment by age on BMI in the model, we can be sure that in 
case of positive findings the BMI change is in fact caused by 
variations of heteroplasmy at the particular mtSNP, and not by 
differences in age that jointly influence BMI and heteroplasmy, 
without an effect of heteroplasmy on BMI. 



P-values were obtained from the Wald test, which is based on 
the asymptotic normality of the regression coefficient estimates, 
and corrected for multiple comparisons applying the Benjamini- 
Hochberg false discovery rate (FDR) using the R function p. adjust 
[50]. The resulting adjusted P-values are also known as "Q; 
values". Each type of genotyping chip needs to be analyzed 
separately because different chips make use of difierent technol- 
ogies, even between chips of the same manufacturer. The two 
Affymetrix chips, AffyG.O and Axiom, share 170 positions and 
lUumina chips, Exome and Metabochip, share 44 positions. Only 
9 positions are common to all four chips. Overlapping mtSNPs 
were analysed separately for each respective genotyping chip, 
giving in this way the opportunity of vaUdaUon in the case of 
significant results. AH the analyses were performed with the R 
statistical software [51]. 

Supporting Information 

Figure SI Examples of intensities affected by the cut- 
off. 

(TIFF) 

Figure S2 Resulting p-values from the 1** analysis (left 
side) and 2"*' analysis (right side). 

(TIFF) 

Table SI Estimates of the model parameters for 
mtSNPs with a P„omii.ai< — O-Ol. Genomic position in base 
pairs (bp), alleles, rs_number, and point mutation are based on the 
NCBI dbSNP GRCh38 human genome assembly (rCRS, 
GeneBank ID J01415.2). Alleles are given in terms of major-^mi- 
nor allele. An estimated effect size (Psnp)<0 indicates that the risk 
allele is the minor allele. Nominal p-values and adjusted p-values 
are provided. \l: mean of log2(A/B). Covariate sex baseline: male. 
(DOC) 

Text SI Quality control. 

(DOCX) 
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